NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

AutoFR: Automated Filter Rule Generation for Adblocking

https://doi.org/10.1145/3703836

Le, Hieu; Elmalaki, Salma; Markopoulou, Athina; Shafiq, Zubair (February 2025, ACM Transactions on Privacy and Security)

Adblocking relies on filter lists, which are manually curated and maintained by a community of filter list authors. Filter list curation is a laborious process that does not scale well to a large number of sites or over time. In this article, we introduce AutoFR, a reinforcement learning framework to fully automate the process of filter rule creation and evaluation for sites of interest. We design an algorithm based on multi-arm bandits to generate filter rules that block ads while controlling the trade-off between blocking ads and avoiding visual breakage. We test AutoFR on thousands of sites and show that it is efficient: It takes only a few minutes to generate filter rules for a site of interest. AutoFR is effective: It optimizes filter rules for a particular site that can block 86% of the ads, as compared to 87% by EasyList, while achieving comparable visual breakage. Using AutoFR as a building block, we devise three methodologies that generate filter rules across sites based on: (1) a modified version of AutoFR, (2) rule popularity, and (3) site similarity. We conduct an in-depth comparative analysis of these approaches by considering their effectiveness, efficiency, and maintainability. We demonstrate that some of them can generalize well to new sites in both controlled and live settings. We envision that AutoFR can assist the adblocking community in automatically generating and updating filter rules at scale.
more » « less
Free, publicly-accessible full text available February 28, 2026
A SCIENTIFIC APPROACH TO TECH ACCOUNTABILITY

Hartzog, Woodraw; Jordan, Scott; Choffnes, David; Markopoulou, Athina; Shafiq, Zubair (February 2024, Harvard Journal of Law and Technology (JOLT))

Full Text Available
Tracking, Profiling, and Ad Targeting in the Alexa Echo Smart Speaker Ecosystem

https://doi.org/10.1145/3618257.3624803

Iqbal, Umar; Bahrami, Pouneh Nikkhah; Trimananda, Rahmadi; Cui, Hao; Gamero-Garrido, Alexander; Dubois, Daniel J; Choffnes, David; Markopoulou, Athina; Roesner, Franziska; Shafiq, Zubair (October 2023, ACM Internet Measurement Conference 2023)

Full Text Available
AutoFR: Automated Filter Rule Generation for Adblocking

Le, Hieu; Elmalaki, Salma; Markopoulou, Athina; Shafiq, zubair (January 2023, Proceedings of USENIX Security 2023)

Adblocking relies on filter lists, which are manually curated and maintained by a community of filter list authors. Filter list curation is a laborious process that does not scale well to a large number of sites or over time. In this paper, we introduce AutoFR, a reinforcement learning framework to fully automate the process of filter rule creation and evaluation for sites of interest. We design an algorithm based on multi-arm bandits to generate filter rules that block ads while controlling the trade-off between blocking ads and avoiding visual breakage. We test AutoFR on thousands of sites and we show that it is efficient: it takes only a few minutes to generate filter rules for a site of interest. AutoFR is effective: it generates filter rules that can block 86% of the ads, as compared to 87% by EasyList, while achieving comparable visual breakage. Furthermore, AutoFR generates filter rules that generalize well to new sites. We envision that AutoFR can assist the adblocking community in filter rule generation at scale.
more » « less
Full Text Available
AutoFR: Automated Filter Rule Generation for Adblocking

Le, Hieu; Elmalaki, Salma; Markopoulou, Athina; Shafiq, zubair (January 2023, Proceedings of USENIX Security 2023)

Adblocking relies on filter lists, which are manually curated and maintained by a community of filter list authors. Filter list curation is a laborious process that does not scale well to a large number of sites or over time. In this paper, we introduce AutoFR, a reinforcement learning framework to fully automate the process of filter rule creation and evaluation for sites of interest. We design an algorithm based on multi-arm ban- dits to generate filter rules that block ads while controlling the trade-off between blocking ads and avoiding visual breakage. We test AutoFR on thousands of sites and we show that it is efficient: it takes only a few minutes to generate filter rules for a site of interest. AutoFR is effective: it generates filter rules that can block 86% of the ads, as compared to 87% by EasyList, while achieving comparable visual breakage. Furthermore, AutoFR generates filter rules that generalize well to new sites. We envision that AutoFR can assist the adblocking community in filter rule generation at scale.
more » « less
Full Text Available
TrackerSift: untangling mixed tracking and functional web resources

https://doi.org/10.1145/3487552.3487855

Amjad, Abdul Haddi; Saleem, Danial; Gulzar, Muhammad Ali; Shafiq, Zubair; Zaffar, Fareed (November 2021, In Proceedings of the 21st ACM Internet Measurement Conference (IMC '21))

Full Text Available
CV-Inspector: Towards Automating Detection of Adblock Circumvention

https://doi.org/10.14722/ndss.2021.24055

Le, Hieu; Markopoulou, Athina; Shafiq, Zubair (January 2021, Network and Distributed Systems Security (NDSS) Symposium 2021)
null (Ed.)
The adblocking arms race has escalated over the last few years. An entire new ecosystem of circumvention (CV) services has recently emerged that aims to bypass adblockers by obfuscating site content, making it difficult for adblocking filter lists to distinguish between ads and functional content. In this paper, we investigate recent anti-circumvention efforts by the adblocking community that leverage custom filter lists. In particular, we analyze the anti-circumvention filter list (ACVL), which supports advanced filter rules with enriched syntax and capabilities designed specifically to counter circumvention. We show that keeping ACVL rules up-to-date requires expert list curators to continuously monitor sites known to employ CV services and to discover new such sites in the wild — both tasks require considerable manual effort. To help automate and scale ACVL curation, we develop CV-INSPECTOR, a machine learning approach for automatically detecting adblock circumvention using differential execution analysis. We show that CV-INSPECTOR achieves 93% accuracy in detecting sites that successfully circumvent adblockers. We deploy CV-INSPECTOR on top-20K sites to discover the sites that employ circumvention in the wild.We further apply CV-INSPECTOR to a list of sites that are known to utilize circumvention and are closely monitored by ACVL authors. We demonstrate that CV-INSPECTOR reduces the human labeling effort by 98%, which removes a major bottleneck for ACVL authors. Our work is the first large-scale study of the state of the adblock circumvention arms race, and makes an important step towards automating anti-CV efforts.
more » « less
Full Text Available
Understanding Incentivized Mobile App Installs on Google Play Store

https://doi.org/10.1145/3419394.3423662

Farooqi, Shehroze; Feal, Álvaro; Lauinger, Tobias; McCoy, Damon; Shafiq, Zubair; Vallina-Rodriguez, Narseo (October 2020, Proceedings of the ACM Internet Measurement Conference)
null (Ed.)
Full Text Available
Eluding ML-based Adblockers With Actionable Adversarial Examples

https://doi.org/10.1145/3485832.3488008

Zhu, Shitong; Wang, Zhongjie; Chen, Xun; Li, Shasha; Man, Keyu; Iqbal, Umar; Qian, Zhiyun; Chan, Kevin S.; Krishnamurthy, Srikanth V.; Shafiq, Zubair; et al (December 2021, ACSAC: Annual Computer Security Applications Conference)

Online advertisers have been quite successful in circumventing traditional adblockers that rely on manually curated rules to detect ads. As a result, adblockers have started to use machine learning (ML) classifiers for more robust detection and blocking of ads. Among these, AdGraph which leverages rich contextual information to classify ads, is arguably, the state of the art ML-based adblocker. In this paper, we present a4, a tool that intelligently crafts adversarial ads to evade AdGraph. Unlike traditional adversarial examples in the computer vision domain that can perturb any pixels (i.e., unconstrained), adversarial ads generated by a4 are actionable in the sense that they preserve the application semantics of the web page. Through a series of experiments we show that a4 can bypass AdGraph about 81% of the time, which surpasses the state-of-the-art attack by a significant margin of 145.5%, with an overhead of <20% and perturbations that are visually imperceptible in the rendered webpage. We envision that a4’s framework can be used to potentially launch adversarial attacks against other ML-based web applications.
more » « less
Full Text Available
FlowTrace: A Framework for Active Bandwidth Measurements using In-band Packet Trains

https://doi.org/10.1007/978-3-030-44081-7_3

Ahmed, Adnan; Mok, Ricky; Shafiq, Zubair (January 2020, Passive and Active Measurement Conference (PAM), 2020)

Full Text Available

« Prev Next »

Search for: All records